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I discuss our current understanding of parton distributions. I begin with the underlying theoretical framework, and 
the way in which different data sets constrain different partons, highlighting recent developments. The methods of 
examining the uncertainties on the distributions and those physical quantities dependent on them is analysed. Finally 
1 look at the evidence that additional theoretical corrections beyond NLO perturbative QCD may be necessary, what 
type of corrections are indicated and the impact these may have on the uncertainties. 



1 Introduction 

The proton is described by QCD - the theory of the 
strong interactions. This makes an understanding of 
its structure a difficult problem. However, it is also 
a very important problem - not only as a question in 
itself, but also in order to search for and understand 
new physics. Many important particle colliders use 
hadrons HERA is an ep collider, the Tevatron is a 
pp collider, the LHC at CERN will be a pp collider, 
and an understanding of proton structure is essential 
in order to interpret the results. Fortunately, when 
one has a relatively large scale in the process, in prac- 
tice only > lGeV 2 , the proton is essentially made up 
of the more fundamental constituents - quarks and 
gluons (partons), which interact relatively weakly. 
Hence, the fundamental quantities one requires in the 
calculation of scattering processes involving hadronic 
particles are the parton distributions. These can be 
derived from, and then used within, the factorization 
theorem which separates processes into nonperturba- 
tive parts which can be determined from experiment, 
and perturbative parts which can be calculated as a 
power-series in the strong coupling constant as- 

This is illustrated in the canonical example of 
deep inelastic scattering. The cross-section for the 
virtual photon-proton interaction can be written in 
the factorized form 

a(ep -> eX) = £ Cf 13 (x, a s (Q 2 )) ® fi(x, Q 2 ) 



where Q 2 is the photon virtuality, x — 7^7, the mo- 
mentum fraction of parton (z/=energy transfer in the 
lab frame), and the fi(x, Q 2 ) are the parton distribu- 
tions, i.e the probability of finding a parton of type i 
carrying a fraction x of the momentum of the hadron. 
Corrections to the above formula are of 0(Aq CD /Q 2 ) 



and are known as higher twist. The parton distribu- 
tions are not easily calculable from first principles. 
However, they do evolve with Q 2 in a perturbative 
manner, satisfying the evolution equation 



df t (x,Q 2 ) 
dlnQ 2 



= Y j Pij{x^ s {Q 2 ))® f 3 {x,Q 2 ) 



where the splitting functions Pij{x, a s (Q 2 )) are cal- 
culable order by order in perturbation theory. The 
coefficient functions Cf (x, a s (Q 2 )) describing a hard 
scattering process are process dependent but are 
calculable as a power-series, i.e Cf(x,a s (Q 2 )) = 
Sfc Cf' ( x ) a s (Q 2 ). Since the fi(x,Q 2 ) are process- 
independent, i.e. universal, once they have been 
measured at one experiment, one can predict many 
other scattering processes. 

Global fits 1 ~~ 7 use all available data, largely 
structure functions, and the most up-to-date QCD 
calculations, currently NLO-in-a s (Q 2 ), to best de- 
termine these parton distributions and their conse- 
quences. In the global fits input partons are param- 
eterized as, e.g. 

xf(x, Ql) = (1 - xf{\ + ex - 5 + 1X )x s 

at some low scale Qq ~ 1 — 5GeV 2 , and evolved up- 
wards using NLO evolution equations. Perturbation 
theory should be valid if Q 2 > 2GeV 2 , and hence 
one fits data for scales above 2 — 5GeV 2 , and this cut 
should also remove the influence of higher twists, i.e. 
power-suppressed contributions. 

In principle there are many different parton dis- 
tributions - all quarks and antiquarks and the glu- 
ons. However, rn c ,mb ^> Aqcd (and top does not 
usually contribute), so the heavy parton distribu- 
tions are determined perturbatively. Also we usu- 
ally assume s = s, and that isospin symmetry holds, 
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i.e. p — > n leads to d(x) — > and — > 
This leaves 6 independent combinations. Relating s 
to 1/2 (u + d) we have the independent distributions 



involves contributions to the coefficient functions ~ 
(Q 2 ) m 2n_1 (l — x) and higher twist contributions 



are known to be enhanced as i ^ 1. Hence, in order 
to to avoid contamination of NLO theory one makes 



u v = u-u, d v = d-d, sea = 2* (u + d + a), d-u, g. & ^ ^ 2 = q2{1/x _ 1) + m 



15GeV^ 



It is also convenient to define X = uy + dy + sea + 
(c+c) + (b+b). There are then various sum rules con- 
straining parton inputs and which are conserved by 
evolution order by order in as, i.e. the number of up 
and down valence quarks and the momentum carried 
by partons (the latter being an important constraint 
on the gluon which is only probed indirectly), 

/ xT,(x) + xg(x) dx = 1. 
Jo 

When extracting partons one needs to consider 
that not only are there 6 independent combinations, 
but there is also a wide distribution of x from 0.75 
to 0.00003. One needs many different types of ex- 
periment for a full determination. The sets of data 
usually used are: HI and ZEUS F^{x,Q 2 ) data 8 ' 9 
which covers small x and a wide range of Q 2 ; E665 
F%> d (x, Q 2 ) data 10 at medium x; BCDMS and SLAC 
F%' d (x,Q 2 ) data 11 ' 12 at large x; NMC F^ d (a 



at medium and large x; CCFR F 2 

7 v(v)p 



(X, 



,2)13 

and 



F^^ w (x, Q 2 ) data 14 at large x which probe the sin- 



glet and valence quarks independently; ZEUS and 
H1 F Lharm( x >Q 2 ) data 15 ' 16 ; E605 P N - M /2 + 
X 17 constraining the large x sea; E866 Drell-Yan 
asymmetry 18 which determines d — u; CDF W- 
asymmetry data 19 which constrains the u/d ratio at 
large x; CDF and DO inclusive jet data 20 ' 21 which tie 
down the high x gluon; and NuTev Dimuon data 22 
which constrain the strange sea. 

The determination of the different partons in 
given kinematic ranges can be split into a few differ- 
ent classes. We begin with large x. Here the quark 
distributions are determined mainly from structure 
functions, which are dominated by non-singlet va- 
lence distributions. Both the evolution of these 
non-singlet distributions and conversion to structure 
functions is quite simple involving no parton mixing 



df Nb (x,Q 2 ) 
d\nQ 2 

7 NS 



= P NS (x,a s (Q*)) 



F^{x,Q 2 )=C Nb (x,a s {Q 2 )) 



f NS {x,Q 2 ) 
f NS {x,Q 2 ). 



Hence, the evolution of high x structure functions is a 
good test of the theory and of as(Q 2 )- The success 
is shown in Fig. 1. However - perturbation theory 
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Figure 1. Description of large x BCDMS and SLAC measure- 
ments of . 

The extension to very small x has been made 
in the past decade by HERA. In this region there 
is very great scaling violation of the partons from 
the evolution equations and also interplay between 
the quarks and gluons. At each subsequent order 
in as each splitting function and coefficient function 
obtains an extra power of ln(l/x) (some accidental 
zeros in P gg ), i.e. Pij(x, a s (Q 2 )), Cf (x, a s (Q 2 )) ~ 
a™(Q 2 ) ln m_1 (l/x), and hence the convergence at 
small x is questionable. The global fits usually as- 
sume that this turns out to be unimportant in prac- 
tice, and proceed regardless. The fit is good, but 
could be improved. The large ln(l/x) terms mean 
that small x predictions are somewhat uncertain, 
as will be discussed later. Small x parton distribu- 
tions are therefore an interesting field of study within 
QCD. They are also vital for understanding the stan- 
dard production processes at the LHC, and perhaps 
some of the more exotic ones, as shown in Fig. 2, 
which demonstrates the range of x probed by the 
experiment. 
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LHC parton kinematics 
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x 12 = (M/14TeV) exp(±y) 
Q = M 




Figure 2. The range of x probed at HERA and the LHC as a 
function of the hard scale, e.g. particle mass, and rapidity. 
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Figure 3. E866 fit to their Drell-Yan data as a function 
(quark) and X2 (antiquark). 
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The high-x sea quarks are determined by 
Drell-Yan data (assuming good knowledge of the va- 
lence quarks). There is new precise data from the 
E866/NuSea collaboration 23 , and their fit to these 
data shows a discrepancy with existing partons im- 
plying larger high-x valence quarks, as shown in 



Fig. 4. However, the fit performed by MRST (Fig. 4) 
and CTEQ displays no such discrepancy. 

E866 pd data and MRST2001 (x F > 0.45) 




* lu M(GeV) 
Figure 4. MRST fit to the E866 data for high values of x F . 

The s(x) and s(x) distributions are probed 
using CCFR and NuTeV dimuon data, i.e. the pro- 
cesses 



c(/i ). 



The quality of data is now such that one can exam- 
ine the s(x) and s(x) distributions separately. This 
has recently been performed in detail by CTEQ 24 . 
They find that s(x) < s(x) at quite small x, but 
since J (s(x) — s(x)) dx = 0, (zero strangeness num- 
ber) this leads to — + / x(s(x) — s(x)) dx = [S~] > 0, 
as demonstrated in Fig. 5. They obtain the rough 
constraint < [S~] < 0.004. This is particularly 
significant because NuTeV measure 25 



'NC 



'NC 



°cc — "cc 

and in the standard model this satisfies R~ = 
i - sin 2 9 W - (1 - | sin 2 6w)\y=\- There is cur- 
rently a 3ct discrepancy between this determination 
of sin 9 W and others 26 but [S~] = 0.002 reduces this 
anomaly from 3<r to 1.5o\ NuTeV themselves claim 
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Momentum Asymmetry 
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Figure 5. CTEQ strange momentum asymmetry (top) and 
number asymmetry (bottom). 

no such strange asymmetry when using partons ob- 
tained from fitting their own data 22 , so this is an 
issue which requires resolution. 

MRST also look at the effect of isospin 
violation 27 since R~ also depends on this - 
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2[V~] 

where [6U V ] = - [D% [5D V ] = [DP] - [£/?], and 
MRST use the simple parameterization 

U P(X) = d n v ( X ) + Kf(x), dP( X ) = U n v { X ) - Kf(x), 

where f(x) is a simple function maintaining required 
conservation laws. The dependence on k is shown 
in Fig. 6. The best fit value of k = —0.2 leads 
to a similar reduction of the NuTeV anomaly, i.e. 
Asin 2 9w ~ —0.002. But there is only a weak indi- 
cation of this value and a fairly wide variation in k 
is allowed. 

The best determination of the high-x gluon 
distribution comes from inclusive jet measure- 
ments by DO and CDF at Tevatron. They mea- 



Figure 6. Ax against the isospin violating parameter ft. 

sure da/dExdr) for central rapidity CDF or in bins 
of rapidity DO. At central rapidity the kinematic 
equality (at LO) is x — 2Et/\/s, and measurements 
extend up to E T — 400GeV(a; — 0.45), and down 
to Et ~ 60GeV(a; ~ 0.06). Gluon-gluon fusion 
dominates the hard cross-section, but g(x,[i 2 ) falls 
off more quickly as x — > 1 than q(x,n 2 ) so there 
is a transition from gluon-gluon fusion at small x, 
to gluon-quark to quark-quark at high x. However, 
as seen in Fig. 7 even at the highest x gluon-quark 
contributions are significant. Jet photoproduction at 
HERA will be another constraint in the future. 




100 200 300 400 500 600 700 
E T (GeV) 

Figure 7. Fractional contributions to jet cross-section from 
different parton-level contributions. 

The above procedure completely determines the 
parton distributions at present. The total fit is rea- 
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Table 1. Quality of fit to data for CTEQ6M. 



Data Set 


no. of data 


x 2 


HI ep 


230 


228 


ZEUS ep 


229 


263 


BCDMS p,p 


339 


378 


BCDMS p,d 


251 


280 


NMC up 


201 


305 


E605 (Drell-Yan) 


119 


95 


DO Jets 


90 


65 


CDF Jets 


33 


49 



sonably good and that for CTEQ6 2 is shown in Table 

1 for the large data sets. The total x 2 = 1954/1811. 
For MRST The total X 2 = 2328/2097 - but the er- 
rors are treated differently, and different data sets 
and cuts are used. The same sort of conclusion is 
true for other global fits 3 ~ 7 (which use fewer data). 
However, there are some areas where the theory per- 
haps needs to be improved, as we will discuss later. 

2 Parton Uncertainties 

2.1 Hessian (Error Matrix) approach 

In this one defines the Hessian matrix H by 

id 

H is related to the covariance matrix of the param- 
eters by Cij(a) = Ax 2 (-ff -1 )ij, and one can use the 
standard formula for linear error propagation, 

hi 

This has been employed to find partons with errors 
by Alekhin 5 , as seen in Fig. 8 and HI 6 (each with 
restricted data sets). 

The simple method can be problematic with 
larger data sets and larger numbers of parameters 
due to extreme variations in A\ 2 in different direc- 
tions in parameter space. This is solved by finding 
and rescaling the eigenvectors of H (CTEQ 28 ' 29 ' 2 ) 
leading to the diagonal form 

i 

The uncertainty on a physical quantity is given by 
(AF) 2 =^(F(S 1 W)-F(t ) )) 2 , 

i 

where S { t +) and S { ~ } are PDF sets displaced along 
eigenvector directions by a given A% 2 . Similar eigen- 
vector parton sets have also been introduced by 




Figure 8. Results for Alekhin partons at Q 2 = 9GeV 2 with 
uncertainties (solid lines), (dashed lines - CTEQ5M, dotted 
lines - MRST01). 



MRST 31 and ZEUS. However, there is an art in 
choosing the "correct" A\ 2 given the complication 
of the errors in the full fit 32 . Ideally A% 2 = 1 , but 
this leads to unrealistic errors, e.g. values of as(M§) 
obtained by CTEQ using A% 2 = 1 for each data set 
in the global fit are shown in Fig. 9, and are not con- 
sistent. CTEQ choose Ax 2 ~ 100, which is perhaps 
conservative. MRST choose A\ 2 ~ 50. An example 
of results is shown in Fig. 10. 

Ax 2 - 1 ranges from GA 
0.09 0.1 0.11 0.12 0.13 0.14 0.15 

-»- BCDMSp 
— •— BCDMSd 

— •— Hla 

• Hlb 

— •— ZEUS 

• NMCp 

« NMCr 

—»— CCFR2 
— •— CCFR3 
« E605 

» CDFw 
« E866 

-•- DOjet 
-•- CDFjet 

0.09 0.1 0.11 0.12 0.13 0.14 0.15 

alpha S 

Figure 9. Values of ctg(M^) and their uncertainties using 
Ax 2 = 1 from CTEQ. 
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Figure 10. Luminosity uncertainties using the CTEQ Hessian 
approach. 

2.2 Offset method 

In this the best fit and parameters ao are obtained 
using only uncorrelated errors. The quality of the 
fit is then estimated by adding uncorrelated and cor- 
related errors in quadrature. Roughly speaking sys- 
tematic uncertainties are determined by letting each 
source of systematic error vary by la and adding the 
deviations in quadrature. This procedure is used by 
ZEUS 7 , and leads to an effective A% 2 > 1. 

ZEUS 




Figure 11. The valence partons extracted by ZEUS from a 
global fit and a fit to their own data alone (with some input 
assumptions) . The latter illustrates a potential for a real con- 
straint from HERA data alone in the future. 



2.3 Statistical Approach 

In principle this involves the construction of an en- 
semble of distributions labelled by T each with prob- 
ability P({JF}), where one can incorporate the full 
information about measurements and their error cor- 
relations into the calculation of P({J-}). This is sta- 
tistically correct, and does not rely on the approx- 
imation of linear propagation errors in calculating 
observables. However, it is inefficient, and in prac- 
tice one generates N (N can be as low as 100) dif- 
ferent distributions with unit weight but distributed 
according to P({!F}) 4 . Then the mean \iq and devi- 
ation ao of an observable O are given by 

N N 

i i 
Currently this approach uses only proton DIS 
data sets in order to avoid complicated uncertainty 
issues, e.g. shadowing effects for nuclear targets, and 
also demands consistency between data sets. How- 
ever, it is difficult to find many truly compatible DIS 
experiments, and consequently the Fermi2001 par- 
tons are determined by only HI, BCDMS, and E665 
data sets. They result in good predictions for many 
Tevatron cross-sections, e.g. inclusive jets and W 
and Z total cross-sections. However, the restricted 
data sets mean there is restricted information - data 
sets are deemed either perfect or, in the case of most 
of them, useless - leading to unusual values for some 
parameters, e.g. as(M§) = 0.112±0.001 and a very 
hard dy (x) at high x (together these two features fa- 
cilitate a good fit to Tevatron jets independent of the 
high- a; gluon). These partons would produce some 
extreme predictions, as seen later. Nevertheless, the 
approach does demonstrate that the Gaussian ap- 
proximation is often not good, and therefore high- 
lights shortcomings in the methods outlined in the 
previous sections. It is a very attractive, but ambi- 
tious large-scale project, still in need of some further 
development. In particular I feel it requires the inclu- 
sion of a wider variety of data in order to overcome 
the obstacle presented by the fact that most data 
sets in the global fit are not really as consistent as 
they should be in the strict statistical sense. 

2-4 Lagrange Multiplier method 

This was first suggested by CTEQ 30 and has been 
concentrated on by MRST 31 . One performs the fit 
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while constraining the value of some physical quan- 
tity, i.e. one minimizes 

*(A,a)=x^ o6aJ (a) + AF(a) 

for various values of A. This gives a set of best fits for 
particular values of the quantity F(a) without rely- 
ing on the quadratic approximation for % 2 , as shown 
for aw in Fig'- 12. The uncertainty is then deter- 
mined by deciding an allowed range of A\ 2 ■ One can 
also easily check the variation in x 2 f° r each of the 
experiments in the global fit and ascertain if the to- 
tal A\ 2 is coming specifically from one region, which 
might cause concern. In principle, this is superior to 
the Hessian approach, but it must be repeated for 
each physical process. 

W production at the Tevatron 



X" increase in global analysis as the 

W and H cross sections are varied al the TEVATRON 
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Figure 12. X^i f,ai ^ or CTEQ plotted against aw 



2.5 Results 

I choose the cross-section for W and Higgs produc- 
tion at the Tevatron and LHC (for M H = 115GeV) 
as examples. Using their fixed value of t\g(M§) = 
0.118 and Ax 2 = 100 CTEQ obtain 

Act w (LHC) m ±4% Acr w (Tev) pa ±5% 

Acjff (LHC) fa ±5%. 

Using a slightly wider range of data, A^ 2 ~ 50 and 
a s {M 2 z ) = 0.119 MRST obtain 

Acr w (Tev) « ±1.2% Acr w (LHC) pa ±2% 



±2%. 



Acr ff (Tev) w ±4% Acr H (LHC) 

MRST also allow ag(M|) to be free. In this case 
Aa w is quite stable but A<jh almost doubles. Con- 
tours of variation in \ 2 f° r the predictions of these 
cross-sections are shown in Fig. 13. 



X mere. 
W and H crc 



;e in global analysis as the 

s sections are varied at the LHC 




Per cent change in W cross section 



Figure 13. x -plot for W and Higgs production at the Teva- 
tron (top) and LHC (bottom) with ag free (dashed) and fixed 
(solid) at a s = 0.119 

The same general procedure is also used by 
CTEQ 34 to look at the effect of new physics param- 
eterized by the contact term 

±{2TT/A 2 )(q Ll ^q L )(q L ^q L ). 

The curves in Fig. 14 show the fit to the DO jet data, 
which is the most discriminating data set, for A = 
1.6,2.0,2.4,00 TeV, and A = -1. For the highest 
values of A the fit even improves very slightly, but 
A > 1.6, TeV is clearly ruled out. 

Hence, the estimation of uncertainties due to ex- 
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Table 2. Values of a s (M|) and its error from different NLO QCD fits. 



Group 


Ax 2 


a 5 (Ml) 


CTEQ6 


Ax 2 = 


100 


a 3 (M 2 z ) 


ZEUS 




= 50 


<*.(M%) 


MRST01 


Ax 2 = 


20 


<x.(M%) 


HI 


A X 2 = 


= 1 


a s {M%) 


Alekhin 


Ax 2 = 


: 1 


a s {M%) 


GKK 


CL 




a s {M%) 



0.1165 ± 0.0065(exp) 

0.1166 ±0.0049(ea;p) ±0.0018 (mocfeZ) ±0.004(theory) 

0.1190 ± 0.002(exp) ±0.003{theory) 

0.115 ±0.0017(ea;p) ± (model) ±0.005 {theory) 

0.1171 ± 0.0015(exp) ±0.0033(theory) 

0.112 ± 0.001(exp) 




1.5<//<2 









Figure f4. Fits to DO jets for different values of A. 

perimental errors has many different approaches and 
different types and amount of data actually fit. Over- 
all the uncertainty from this source is rather small - 
only more than a few % for quantities determined 
by the high x gluon and very high x down quark. 
This is illustrated for the determinations of as(M§) 
in Table 2. There is generally good agreement, but 
their are some outlying values. 

These outlying values of as(M§) show that dif- 
ferent approaches can sometimes lead to rather dif- 
ferent central values, This suggests that there are 
other matters to consider as well as the experimen- 
tal errors on data. We also need to determine the 
effect of assumptions made about the fit, e.g. cuts 
made on the data, the data sets fit, the parameter- 
ization for input sets, the form of the strange sea, 
etc.. Many of these can be as important as the er- 
rors on the data used (or more so). This is demon- 
strated by the results from the LHC /LP Study Work- 
ing Group 35 shown in Tables 3, and by predictions for 
a w by MRST CTEQ and Alekhin 36 in Table 4. In 
both cases the discrepancies are mainly due to differ- 
ences in detailed constraints (by data) on the quark 
decomposition. Differences between predictions are 



Table 3. Cross sections for Drell-Yan pairs (e+e ) with 
PYTHIA 6.206, rapidity < 2.5. The errors shown are the PDF 
uncertainties. 



PDF set 



Comment | xscc [pb] | PDF uncertainty % 
81 < M < 101 GcV 



CTEQ6 


LHAPDF 


1065 


± 


46 


4.4 


MRST2001 


LHAPDF 


1091 


± 




3 


Fermi2002 


LHAPDF 


853 


± 


18 


2.2 



Table 4. Comparison of aw ■ Bi v for different partons. 



PDF set 


Comment 


xsec [nb] 


PDF uncertainty 


Alekhin 


Tevatron 


2.73 


± 0.05 (tot) 


MRST2002 


Tevatron 


2.59 


± 0.03 (expt) 


CTEQ6 


Tevatron 


2.54 


± 0.10 (expt) 


Alekhin 


LHC 


215 


± 6 (tot) 


MRST2002 


LHC 


204 


± 4 (expt) 


CTEQ6 


LHC 


205 


± 8 (expt) 



also shown by Fig. 15 - the predictions for W and 
Higgs production at the Tevatron from MRST2001 
and CTEQ6, and Fig. 16 - the comparison between 
the gluons for the two parton sets. 



X" increase in L'lohal analysis as the 

W and H cross sections are varied at the TEVATRON 



• CTEQ6 



Per cent change in W cross section 



Figure 15. \ 2 -^>\ot for W and Higgs production at the Teva- 
tron with as free. The predictions from CTEQ6 is marked. 
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Uncertainty of gluon from Hessian method 




Figure 16. Fractional uncertainty in the MRST gluon com- 
pared with the difference in the central CTEQ6 gluon. 

3 Theoretical errors 

3.1 Problems in the fit 

As well as the consequences of these assumptions we 
must consider the related problem of theoretical er- 
rors. Theoretical errors are indicated by some re- 
gions where the theory perhaps needs to be improved 
to fit the data better. There is a reasonably good fit 
to HERA data, but there are some problems at the 
highest Q 2 at moderate x, i.e. in dF 2 / dlnQ 2 , as seen 
for MRST and CTEQ in Fig. 17. Also the data re- 
quire the gluon to be valencelike or negative at small 
x at low Q 2 , e.g. the ZEUS gluon in Fig. 18, leading 
to Fl(x,Q 2 ) being negative 1 at the smallest x,Q 2 . 
However, it is not just the low x-low Q 2 data that 
require this negative gluon. The moderate x data 
need lots of gluon to get a reasonable dF 2 /d\nQ 2 
and the Tevatron jets need a large high x gluon, and 
this must be compensated for elsewhere. In general 
MRST find that it is difficult to reconcile the fit to 
jets and to the rest of the data, Fig. 19, and that 
different data compete over the gluon and as(M^). 
The jet fit is better for CTEQ6 largely due to their 
different cuts on other data. Other fits do not in- 
clude the Tevatron jets, but generally produce gluons 
largely incompatible with this data. 
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Figure 19. The MRST fit to DO jet data. The points show 
the range of the systematic errors. 



3.2 Types of Theoretical Error, NNLO 

It is vital to consider theoretical errors. These 
include higher perturbative orders (NNLO), small 
x (ajln n_1 (l/a;)), large x (ajln 2 ™- 1 ^ - x)) low 
Q 2 (higher twist), etc.. Note that renormaliza- 
tion/factorization scale variation is not a reliable 
method of estimating these theoretical errors because 
of increasing logs at higher orders. 
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Figure 20. LO, NLO and NNLO predictions for W and Z 
cross-sections. 

In order to investigate the true theoretical error 
we must consider some way of performing correct 
large and small x resummations, and/or use what 
we already know about NNLO. The coefficient func- 
tions are known at NNLO. Singular limits x — > 1, 
x — > are known for NNLO splitting functions as 
well as limited moments 37 , and this has allowed ap- 
proximate NNLO splitting functions to be devised 38 
which have been used in approximate global fits 39 . 
They improve the quality of fit very slightly (mainly 
at high x) and as(M§) lowers from 0.119 to 0.1155. 
The gluon is smaller at NNLO at low x due to 
the positive NNLO quark-gluon splitting function. 
There is also a NNLO fit by Alekhin 40 , with some 
differences - the gluon is not smaller, probably due 
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Figure 21. Comparison of the predictions for Fl(x, Q 2 ) at LO, 
NLO and NNLO using MRST partons and also a ln(l/i)- 
rcsummed prediction 42 . 

to the absence of Tevatron jet data in the fit and to 
a very different definition of the NNLO charm con- 
tribution. There is agreement in the reduction of 
a 5 (M|) at NNLO, i.e. 0.1171 -> 0.1143. 

Using these NNLO partons there is reasonable 
stability order by order for the (quark-dominated) 
W and Z cross-sections, as seen in Fig. 20. However, 
the change from NLO to NNLO is of order 4%, which 
is much bigger than the uncertainty at NLO due to 
experimental errors. Also, this fairly good conver- 
gence is largely guaranteed because the quarks are 
fit directly to data. There is greater danger in gluon 
dominated quantities, e.g. Fl(x, Q 2 ), as can be seen 
in Fig. 21. Hence, the convergence from order to 
order is uncertain. 

3.3 Empirical approach 

We can estimate where theoretical errors may be im- 
portant by adopting the empirical approach of inves- 
tigating in detail the effect of cuts on the fit quality, 
i.e. we try varying the kinematic cuts on data. The 
procedure is to change W^ ut , Q 2 ut and/or x cut , re-fit 
and see if the quality of the fit to the remaining data 
improves and/or the input parameters change dra- 
matically. (This is similar to a previous suggestion 
in terms of data sets 41 .) One then continues until 



the quality of the fit and the partons stabilize 27 . 

For W 2 ut raising from 15GeV 2 has no effect. 
When raising Q 2 cut from 2GeV 2 in steps there is 
a slow, continuous and significant improvement for 
Q 2 up to > lOGeV 2 (560 data points cut), sug- 
gesting that any corrections are probably higher or- 
ders not higher twist. The input gluon becomes 
slightly smaller at low x at each step (where one 
loses some of the lowest x data), and larger at high 
x. as(M§) slowly decreases by about 0.0015. Rais- 
ing x cu t leads to continuous improvement with sta- 
bility reached at x = 0.005 (271 data points cut) 
with as(Mg) — ► 0.118. There is an improvement 
in the fit to HERA, NMC and Tevatron jet data, 
and much reduced tension between the data sets. At 
each step the moderate x gluon becomes more pos- 
itive, at the expense of the gluon below the cut be- 
coming very negative and dF 2 (x, Q 2 )/d\nQ 2 being 
incorrect. However, higher orders could cure this in 
a quite plausible manner. For example adding higher 
order terms to the splitting functions 

'ln 3 (l/;r) \n 2 (l/xY 
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leaves the improved fit above x = 0.005 largely un- 
changed, but solves the problem below x = 0.005. 
Saturation corrections added to NLO and NNLO fits 
seem to make the situation worse. Hence, the cuts 
are suggestive of theoretical errors for small x and/or 
small Q 2 . Predictions for W and Higgs cross-sections 
at the Tevatron are still safe if x cu t = 0.005, since 
they do not sample partons at lower x. However, 
they change in a smooth manner as x cut is lowered, 
due to the altered partons above x cu t- 

There is a lot of work on explicit ln(l/ir)- 
resummations in structure functions and parton dis- 
tributions for example 42 ' 43 ' 44 , but there is no com- 
plete consensus on the best approach. There is 
also work on connecting the partons to alterna- 
tive approaches at small x, e.g. dipole models 45 , 
and pomerons 46 . These approaches can suggest im- 
provements to the fits and changes in predictions, 
e.g. a resummed prediction 42 for Fl(x, Q 2 ) is shown 
on Fig. 21. Accurate and direct measurements of 
Fl(x,Q 2 ) and other quantities at low x and/or Q 2 
(the predicted range and accuracy of Fl(x, Q 2 ) mea- 
surements at HERA III is shown on Fig. 21) would be 
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a great help in determining whether NNLO is suf- 
ficient or whether resummed (or other) corrections 
are necessary, or helpful for maximum precision. 

4 Conclusions 

One can perform global fits to all up-to-date data 
over a wide range of parameter space, and there are 
various ways of looking at uncertainties due to er- 
rors on data alone. There is no totally preferred 
approach. The errors from this source are rather 
small - ~ 1 — 5% except in a few regions of param- 
eter space and are similar using various approaches. 
The uncertainty from input assumptions e.g. cuts 
on data, parameterizations etc., are comparable and 
sometimes larger, which means one cannot entirely 
believe one group's errors. 

The quality of the fit is fairly good, but there are 
some slight problems. These imply that errors from 
higher orders/resummation are potentially large in 
some regions of parameter space, and due to corre- 
lations between partons these affect all regions (the 
small x gluon influences the large x gluon). Cutting 
out low x and/or Q 2 data allows a much-improved fit 
to the remaining data, and altered partons. Hence, 
for some processes theory is probably the dominant 
source of uncertainty at present and a systematic 
study is a priority as is more data which would help 
determine our theoretical accuracy. 
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